Search CORE

50 research outputs found

Direct Image to Point Cloud Descriptors Matching for 6-DOF Camera Localization in Dense 3D Point Cloud

Author: AR Zamir
DG Lowe
JL Schönberger
L Breiman
MA Fischler
N Snavely
PH Torr
S Lazebnik
T Sattler
X-S Gao
Y Li
Publication venue
Publication date: 14/06/2019
Field of study

We propose a novel concept to directly match feature descriptors extracted from RGB images, with feature descriptors extracted from 3D point clouds. We use this concept to localize the position and orientation (pose) of the camera of a query image in dense point clouds. We generate a dataset of matching 2D and 3D descriptors, and use it to train a proposed Descriptor-Matcher algorithm. To localize a query image in a point cloud, we extract 2D keypoints and descriptors from the query image. Then the Descriptor-Matcher is used to find the corresponding pairs 2D and 3D keypoints by matching the 2D descriptors with the pre-extracted 3D descriptors of the point cloud. This information is used in a robust pose estimation algorithm to localize the query image in the 3D point cloud. Experiments demonstrate that directly matching 2D and 3D descriptors is not only a viable idea but also achieves competitive accuracy compared to other state-of-the-art approaches for camera pose localization

arXiv.org e-Print Archive

Crossref

Learning and Matching Multi-View Descriptors for Registration of Point Clouds

Author: A.E. Johnson
Angela Dai
DG Lowe
Emanuele Rodolà
F Endres
F Fraundorfer
F Tombari
François Pomerleau
GF Cooper
H Huang
J Johnson
J Yang
JL Schönberger
KM Yi
MG Dissanayake
MJ Black
N Snavely
PJ Besl
Q-Y Zhou
R Raguram
S Li
T Shen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 15/07/2018
Field of study

Critical to the registration of point clouds is the establishment of a set of accurate correspondences between points in 3D space. The correspondence problem is generally addressed by the design of discriminative 3D local descriptors on the one hand, and the development of robust matching strategies on the other hand. In this work, we first propose a multi-view local descriptor, which is learned from the images of multiple views, for the description of 3D keypoints. Then, we develop a robust matching approach, aiming at rejecting outlier matches based on the efficient inference via belief propagation on the defined graphical model. We have demonstrated the boost of our approaches to registration on the public scanning and multi-view stereo datasets. The superior performance has been verified by the intensive comparisons against a variety of descriptors and matching methods

arXiv.org e-Print Archive

Crossref

Polarimetric Multi-View Inverse Rendering

Author: A Ghosh
A Ley
C Wu
CP Huynh
D Kiku
D Miyazaki
D Miyazaki
DG Lowe
GA Atkinson
GA Atkinson
GA Atkinson
GG Stokes
H Aanæs
H Bay
J Biehler
J Park
JL Schönberger
JT Barron
K Kim
M Kazhdan
M Li
R Zhang
S Ikehata
S Mihoubi
SH Baek
W Smith
Y Furukawa
Y Maruyama
Y Xiong
Publication venue
Publication date: 17/07/2020
Field of study

A polarization camera has great potential for 3D reconstruction since the angle of polarization (AoP) of reflected light is related to an object's surface normal. In this paper, we propose a novel 3D reconstruction method called Polarimetric Multi-View Inverse Rendering (Polarimetric MVIR) that effectively exploits geometric, photometric, and polarimetric cues extracted from input multi-view color polarization images. We first estimate camera poses and an initial 3D model by geometric reconstruction with a standard structure-from-motion and multi-view stereo pipeline. We then refine the initial model by optimizing photometric and polarimetric rendering errors using multi-view RGB and AoP images, where we propose a novel polarimetric rendering cost function that enables us to effectively constrain each estimated surface vertex's normal while considering four possible ambiguous azimuth angles revealed from the AoP measurement. Experimental results using both synthetic and real data demonstrate that our Polarimetric MVIR can reconstruct a detailed 3D shape without assuming a specific polarized reflection depending on the material.Comment: Paper accepted in ECCV 202

arXiv.org e-Print Archive

Crossref

Single-Image Depth Prediction Makes Feature Matching Easier

Author: A Criminisi
A Criminisi
AJ Davison
B Zeisl
C Wu
D Gálvez-López
D Liebowitz
D Mishkin
DG Lowe
ES Jones
G Baatz
G Baatz
G Simon
H Aanæs
J Matas
J Pritts
JL Schönberger
JM Morel
K Cordes
K Mikolajczyk
K Mikolajczyk
L Svärm
MA Fischler
R Garg
R Mur-Artal
S Hinterstoisser
T Lindeberg
T Sattler
W Liu
W Maddern
Y Pang
Publication venue
Publication date: 01/01/2020
Field of study

Good local features improve the robustness of many 3D re-localization and multi-view reconstruction pipelines. The problem is that viewing angle and distance severely impact the recognizability of a local feature. Attempts to improve appearance invariance by choosing better local feature points or by leveraging outside information, have come with pre-requisites that made some of them impractical. In this paper, we propose a surprisingly effective enhancement to local feature extraction, which improves matching. We show that CNN-based depths inferred from single RGB images are quite helpful, despite their flaws. They allow us to pre-warp images and rectify perspective distortions, to significantly enhance SIFT and BRISK features, enabling more good matches, even when cameras are looking at the same scene but in opposite directions.Comment: 14 pages, 7 figures, accepted for publication at the European conference on computer vision (ECCV) 202

arXiv.org e-Print Archive

Crossref

UCL Discovery

Chalmers Research

Capture, Reconstruction, and Representation of the Visual Real World for Virtual Reality

Author: A Collet
A Dai
A Davis
A Dosovitskiy
A Meka
A Parra Pozo
A Serrano
B Luo
B Mildenhall
C Keysers
C Kim
C Lipski
C Schroers
C Weissig
D Lanman
E Penner
F Perazzi
F Prada
G Chaurasia
G Chaurasia
G Nam
G Wetzstein
G Wetzstein
GA Koulieris
H Hukkelås
H Ishiguro
H Kim
H Rhodin
HY Shum
J Lee
J Thies
J Ventura
J Zaragoza
JL Schönberger
K Yücer
M Mori
M Nießner
M Tatarchenko
M Zollhöfer
N Snavely
NK Kalantari
P Hedman
P Hedman
P Hedman
P Hedman
P Moulon
R Anderson
R Konrad
R Martin-Brualla
R Szeliski
R Szeliski
RS Overbeck
S Lombardi
S Niklaus
S Peleg
S Tulsiani
SE Wei
SMA Eslami
T Bertel
T Whelan
T Zhou
T Zhou
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 31/03/2020
Field of study

We provide an overview of the concerns, current practice, and limitations for capturing, reconstructing, and representing the real world visually within virtual reality. Given that our goals are to capture, transmit, and depict complex real-world phenomena to humans, these challenges cover the opto-electro-mechanical, computational, informational, and perceptual fields. Practically producing a system for real-world VR capture requires navigating a complex design space and pushing the state of the art in each of these areas. As such, we outline several promising directions for future work to improve the quality and flexibility of real-world VR capture systems

OPUS

Crossref

The Glasgow Outcome Scale -- 40 years of application and refinement

Author: A Ardolino
A Nichol
A Ponce
AA Garner
AD Mendelow
AD Nicoll
AI Maas
AI Maas
AI Maas
AS Alali
AV Ciurea
B Gabbe
B Jennett
B Jennett
B Jennett
B Jennett
B Roozenbeek
B Roozenbeek
B Sharma
C Willmott
C-A Carlsson
CD Willis
D Barer
D Bates
DH Fiser
DH Fiser
DJ Cooper
DN Brooks
DP Becker
DS Tulsky
DW Wright
E Bagiella
EA Wilde
EL Yuh
EM Moore
FM Hammond
GL Clifton
GM Teasdale
GM Teasdale
Graham Teasdale
GS McHugh
Harvey Levin
HS Levin
J Emberson
J Lu
J Lu
J Ponsford
J Ponsford
J Ponsford
J Weir
Jennie Ponsford
JK Yue
JL Ponsford
JT Wilson
JT Wilson
JT Wilson
JT Wilson
K Draper
K Hall
K Millar
KM Barlow
KR Gould
L Mailhan
L Whitnall
LE Pettigrew
Lindsay Wilson
M Rappaport
M Schönberger
M Vapalahti
MA Foulkes
MD Lezak
Michael Bond
MR Bullock
MR Fearnside
MS Vavilala
N von Steinbuechel
NR Temkin
O Heiskannen
PS London
R Braakman
R Hicks
R Tate
RK Narayan
RL Tate
RL Tate
RL Wood
RM Chesnut
S Honeybul
S Saxena
S Thornhill
SA Bernard
SC Choi
SI Anderson
SR Beers
SR Millis
TM McMillan
TM McMillan
Tom McMillan
TW Langfitt
W Poon
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 15/07/2016
Field of study

The Glasgow Outcome Scale (GOS) was first published in 1975 by Bryan Jennett and Michael Bond. With over 4,000 citations to the original paper, it is the most highly cited outcome measure in studies of brain injury and the second most-cited paper in clinical neurosurgery. The original GOS and the subsequently developed extended GOS (GOSE) are recommended by several national bodies as the outcome measure for major trauma and for head injury. The enduring appeal of the GOS is linked to its simplicity, short administration time, reliability and validity, stability, flexibility of administration (face-to-face, over the telephone and by post), cost-free availability and ease of access. These benefits apply to other derivatives of the scale, including the Glasgow Outcome at Discharge Scale (GODS) and the GOS paediatric revision. The GOS was devised to provide an overview of outcome and to focus on social recovery. Since the initial development of the GOS, there has been an increasing focus on the multidimensional nature of outcome after head injury. This Review charts the development of the GOS, its refinement and usage over the past 40 years, and considers its current and future roles in developing an understanding of brain injury

Crossref

Stirling Online Research Repository (RIOXX)

Enlighten

Stirling Online Research Repository

Corporations and Citizenship Arenas in the Age of Social Media

Crossref

Learning to solve nonlinear least squares for monocular stereo

Author: A Tikhonov
CB Choy
CT Kelley
J Engel
JL Schönberger
O Öktem
R Fletcher
S Hochreiter
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 08/09/2018
Field of study

Sum-of-squares objective functions are very popular in computer vision algorithms. However, these objective functions are not always easy to optimize. The underlying assumptions made by solvers are often not satisfied and many problems are inherently ill-posed. In this paper, we propose a neural nonlinear least squares optimization algorithm which learns to effectively optimize these cost functions even in the presence of adversities. Unlike traditional approaches, the proposed solver requires no hand-crafted regularizers or priors as these are implicitly learned from the data. We apply our method to the problem of motion stereo ie. jointly estimating the motion and scene geometry from pairs of images of a monocular sequence. We show that our learned optimizer is able to efficiently and effectively solve this challenging optimization problem

Crossref

Spiral - Imperial College Digital Repository

Handcrafted Outlier Detection Revisited

Author: B Thomee
DG Lowe
H Durrant-Whyte
H Jegou
J Cech
J Ma
J Sivic
JL Schönberger
JL Schönberger
MA Fischler
O Chum
O Chum
PH Torr
R Raguram
R Ranftl
RI Hartley
S Ullman
T Bailey
W-Y Lin
WY Lin
Y Avrithis
Y Li
Z Dang
Z Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

Local feature matching is a critical part of many computer vision pipelines, including among others Structure-from-Motion, SLAM, and Visual Localization. However, due to limitations in the descriptors, raw matches are often contaminated by a majority of outliers. As a result, outlier detection is a fundamental problem in computer vision and a wide range of approaches, from simple checks based on descriptor similarity to geometric verification, have been proposed over the last decades. In recent years, deep learning-based approaches to outlier detection have become popular. Unfortunately, the corresponding works rarely compare with strong classical baselines. In this paper we revisit handcrafted approaches to outlier filtering. Based on best practices, we propose a hierarchical pipeline for effective outlier detection as well as integrate novel ideas which in sum lead to an efficient and competitive approach to outlier rejection. We show that our approach, although not relying on learning, is more than competitive to both recent learned works as well as handcrafted approaches, both in terms of efficiency and effectiveness. The code is available at https://github.com/cavalli1234/AdaLAM

Crossref

Chalmers Research

Efficient Neighbourhood Consensus Networks via Submanifold Sparse Convolutions

Author: AR Widya
C Schmid
D Marr
DG Lowe
F Schaffalitzky
H Bay
H Zhou
JL Schönberger
K Lenc
K Mikolajczyk
K Mikolajczyk
KI Mori
KM Yi
S Oron
XS Gao
Z Zhang
Publication venue: HAL CCSD
Publication date: 22/04/2020
Field of study

International audienceIn this work we target the problem of estimating accurately localised correspondences between a pair of images. We adopt the recent Neighbourhood Consensus Networks that have demonstrated promising performance for difficult correspondence problems and propose modifications to overcome their main limitations: large memory consumption, large inference time and poorly localised correspondences. Our proposed modifications can reduce the memory footprint and execution time more than

10\times

, with equivalent results. This is achieved by sparsifying the correlation tensor containing tentative matches, and its subsequent processing with a 4D CNN using submanifold sparse convolutions. Localisation accuracy is significantly improved by processing the input images in higher resolution, which is possible due to the reduced memory footprint, and by a novel two-stage correspondence relocalisation module. The proposed Sparse-NCNet method obtains state-of-the-art results on the HPatches Sequences and InLoc visual localisation benchmarks, and competitive results in the Aachen Day-Night benchmark

arXiv.org e-Print Archive

Crossref

INRIA a CCSD electronic archive server